MAESTRO: Dynamic Runtime Power and Concurrency Adaptation
نویسندگان
چکیده
Microprocessor core counts are on a trajectory such that for many applications the microprocessor will soon be overprovisioned for computation with respect to off-chip communication and memory system capabilities. In the absence of strategies to exploit intra-chip communication, performance of even simple benchmarks is noticeably impacted by coscheduling multiple copies on the cores of current system and the problem will get worse. We are developing the MAESTRO runtime system to experiment with the idea of dedicating excess computational resources to (1) to reduce power consumption of memory bound applications with little performance impact, (2) provide guidance to adaptive concurrency control to improve performance, and (3) isolate the performance monitoring and other system activities to reduce the effects of “jitter” on tightly-coupled applications. We describe initial experiments in which we dedicate one core to monitor chip-wide bottlenecks using hardware performance counters. When bottlenecks in the memory system are approached, the frequency (and power) of the cores are reduced. The MAESTRO prototype runs on an AMD Phenom as a Linux daemon pinned to one core while applications are pinned to the remaining cores. On a desktop system, running at low frequency can save up to 36% of the total power consumed with minor performance degredation. MAESTRO allows single core jobs to compute at full frequency and saves power when the bottlenecks exist during parallel execution. We present motivating experiments, describe aspects of the working prototype of MAESTRO, and present some early results.
منابع مشابه
Improving the Efficiency of Parallel Applications on Multithreaded and Multicore Systems
The scalability of parallel applications executing on multithreaded and multicore multiprocessors is often quite limited due to large degrees of contention over shared resources on these systems. In fact, negative scalability frequently occurs such that a non-negligable performance loss is observed through the use of more processors and cores. In this dissertation, we present a prediction model...
متن کاملPreventing Key Performance Indicators Violations Based on Proactive Runtime Adaptation in Service Oriented Environment
Key Performance Indicator (KPI) is a type of performance measurement that evaluates the success of an organization or a partial activity in which it engages. If during the running process instance the monitoring results show that the KPIs do not reach their target values, then the influential factors should be identified, and the appropriate adaptation strategies should be performed to prevent ...
متن کاملEnergy-Efficient and Power-Constrained Techniques for Exascale Computing
The future of computing will be driven by constraints on power consumption. Achieving an exaflop will be limited to no more than 20 MW of power, forcing co-design innovations in both hardware and software to improve overall efficiency. On the hardware side, processor designs are shifting to many-core architectures to increase the ratio of computational power to power consumption. Research and d...
متن کاملOptimization of majority protocol for controlling transactions concurrency in distributed databases by multi-agent systems
In this paper, we propose a new concurrency control algorithm based on multi-agent systems which is an extension of majority protocol. Then, we suggest a clustering approach to get better results in reliability, decreasing message passing and algorithm’s runtime. Here, we consider n different transactions working on non-conflict data items. Considering execution efficiency of some different...
متن کاملOn the Design of Online Predictors for Autonomic Power-Performance Adaptation of Multithreaded Programs
This paper investigates the design space for techniques that enable runtime, autonomic program adaptation for high-performance and low-power execution via event-driven performance prediction. The emerging multithreaded and multicore processor architectures enable applications to trade performance for reduced power consumption via regulating concurrency. At the same time however, power and perfo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008